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1 . SCOPE 


This document contains a design specification for software to 
implement an equiprobable blocks progreun system. This system 
will, on option, perform the following functions. 

a. Conditional probability functions 

b. Classification 

c. Dot selection 

d. Block population comparison 

Training fields will be user-provided. Test and sample fields, 
also user-provided, are used in classification and comparison 
options . 

The equiprobable blocks program will be added as a processor to 
the EOD-LARSYS pattern recognition system. It will be implemented 
on the Purdue-LARS computer (IBM 370/148). (Conversion of the 
EOD-LARSYS system from the UNIVAC EXEC 2 system at JSC to Purdue- 
LARS is currently under way.) 


/ 



2 . APPLICABLE DOCUMENTS 


• Final Design Specification for EOD-LARSYS Procedure 1, Houston , 
Texas, August 1977, JSC-12742, LEC-10417. 

• Job Order 63-1347-1695 

• TIRF 

• M. P. Gessamon, "A Consistent Non-Parametric Multivariate 
Density Estimator Based on Statistically Equivalent Blocks", 

Ann. of Math Stat. , Vol. 41, No. 4, pp. 1344-1346, 1970. 

• M. P. Gessamon and P. H. Gessamon, "A Comparison of Some 
Multivariate Discrimination Procedures", J. of Am. Stat. Assoc., 
Vol. 67, No. 338, pp. 468-472, 1972. 





3. SYSTEM DESCRIPTION 


3.1 HARDWARE DESCRIPTION 

N/A 

3.2 SOFTWARE DESCRIPTION 

The Equiprobable Blocks processor will proceed computationally 
along the following path. 

Read block pareuneters, action options, features, spatial grid 
and parameter D (option) . Read training fields for each category 
(class) defined. Read test fields, seunple fields. For each 
category, build the set of equiprobable blocks. This procedure 
is described in detail later in this section. On option, read a 
ground truth file, assign categories to training field pizels. 

Depending upon action options, oerform one or more of the follow- 
ing: 

a. Compute conditional probability density estimate, f(z|C^), 
j=l, NC where z belongs to one of the training fields of 
category Cj and Cj is the jth category. Output these 
estimates along with training field geometry, features, 
category names, and so on. 

b. Classify the sample fields. This procedure is described in 
detail later in the section. If there are more than two 
categories defined, successive NC-1 sets of blocks are com- 
bined to obtain the "best” set of blocks for classification. 
Write a classification map file (MAPTAP) . 

c. Select dots based on an input spatial grid and maximum count 
parameter D. The equiprobable block set used will be a 
combination of blocks of all categories specified. Output 
the dots selected. 



d. Combine equiprobable block sets from blocks from each category 
to form a combination block set. Compute the proportion of 
pixels per category per block and output these results. 

In all cases where test fields are input, compute the number of 
pixels per category per block and output. In this way, if 
training fields for categories are also provided as test fields, 
a comparison of categories can be achieved. 


Certain details of the computation procedure follow. 
COMPUTATIONAL PROCEDURE: BLOCKS 


Read training field pixels per category, sun angle correct on 
option. 

Construct a set of equiprobable blocks for each category j , 
j=l, NC; 

First compute block parameters 


Option “ compute 



P = number of selected features 

where = number of pixels in training fields for 
category j 


Kj - minimum number of pixels per block 

number of subdivisions in each co-ord direction i 
for each category j 


i 


Option - compute 



Option - compute 



where (Q^j independent of i) 
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3f Mj ■ If 


In order to know which pixels go into the block divisions, an 
ordering on the first co-ordinate must be made. This will be 
the time-consuming part of the computation. The pixels 
involved will belong to the training fields of category j. 


This ordering process also determines the co-ordinate valve 
of the cut. The cuts must be made so that no two are at the 
seune location. (The original requirements document resolves 
this. ) 


i-2: X 

where X 


^2j°2j ®2j 

■ + 1 for the first pixels 

= r^j for the others 


Compute r 2 j = FIX I ^ — 


(*) 


S 


2j 


X - 




The first S 2 j secondary divisions in each strip will contain 
r 2 j + 1 pixels, (upper or "right" boundaries included) , the 
rest will contain r 2 j pixels. 


Example 


co-ordinate 2 


255 


co-ordinate 1 


255 




i-3, . . . P 

This process should be continued for . . .P. At each 

stage, the pixels must be ordered in the 1th co-ordinate 
(channel) for bloc)c division inclusion determination. 

P 

There will be n Q . . blocks at the conclusion. 
i»l 

The volume of each block must be computed and stored. These 
are computed from block boundaries which have been st^)red. 


Once the blocks have been formed for each category, the 
options are exercised. 


Option to compute conditional probability density estimates 
TmT+TTvolT 


where: Z belongs to the sample fields, 

Mj = number of pixels in category j training 
fields, 

VOLj = volume of the block of category j 
containing the pixel Z. 


Output these estimates. 


Option to classify the test fields : 

The "best” set of blocks must first be determined. In the 
case where only 2. categories are specified, the best set is 
determined as follows. 

Using the training fields of category 2 as test fields, 
classify these fields using the blocks of category 1. Compute 
the proportion of pixels misclassif ied. Next, take the 
training fields of category 1 as test fields and classify 
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using the blocks of category 2. Compute the proportion mis- 
classified. Select the block set yielding the smaller 
proportion of misclasslflcatlon for use In classification of 
sample fields. 

In the case of more than 2 categories (N0CAT>2) , successive 
combinations of NOCAT-l block sets are used to classify the 
remaining category training fields and the proportion of 
misclassif ication is computed. The block combination of 
blocks from NOCAT-1 block sets yielding the smallest pro- 
portion of misclassif ication is used in classification. 

The process of classification is as follows. Given a set 
of blocks and seunple fields, for each pixel Z in the sample 
fields, determine which block contains it. Next, for each 
category, determine the proportion of pixels from the train- 
ing fields in that block. The category having the largest 
proportion will claim the pixel Z. In case of ties, pixels 
will be allocated among the categories involved in the ties. 

These classification results will be outputted under the 
EOD-LARSYS MilPTAP format for subsequent display. 

Option to select dots : 

Here the combined set of blocks over all categories is uti- 
lized. Normally, only one category will have been defined 
under this option. (The equiprobable blocks will be built 
from all the training fields.) The sample field provided 
will be overlaid with the spatial grid provided. The D 
(or fewer) grid intersections contained in each block will 
be selected. If there are more than D intersections in a 
particular block, the D dots closest (using L 2 distance) to 
the center of the block will be chosen. If no grid is provided, 
the D pixels closest to the center of each block will be selected. 



The spatial co-ordinates of the set of selected dots will be 
outpu “<>d , 

Option to combine category training sets ; 

All training fields will be combined to form an equiprobable 
block set. The proportion of pixels in each block from each 
training category will be determined and outputted. Also, 
the proportion for test field categories will be determined 
and outputted. 

3.2.1 LINKAGES 

The Equiprobable Blocks processor will use the FORTRAN IV-G 
compiler, IBM system routines, EOD-LARSYS utility routines, and 
EOD-LARSYS common blocks GLOBAL, ISOLNK and INFORM. A new 
common block EQUPRB will be created for other transfers of data 
among subprograms. 

3.2.2 INTERFACES 

The EquiprobeUsle Blocks processor will require as an input file 
an MSS data tape (ERIPS-type unload tape) . It will be invoked 
from the MONTOR routine of EOD-LARSYS upon encountering a 
control input card of the form SEQUPRB. Back-to-back runs with 
the DISPLAY processor of EOD-LARSYS will be possible. 

3.2.3 INPUTS 

a. Processor card 


Keyword 

SEQUPRB 


b. Nei» Control Cards 


Ksyword 

Parametsr 

Function 

CHANNEL 

DATA«C^ , Cj , . . . Cj,Qpg^2 
(Dafaultx 1,2,..., 16) 

Channel numbers to be used. 
These must be a subset of 
channels available on the 
MSS data tape. 

NCPASS 

N 

(Default: N«4) 

Number of channels per 
pass. Used to group 
channels into passes 
(acquisitions) . 

SUNANG 

TAPE 

or 

Si,.. .Sj^A 

Sunangles for sun angle 
correction. If TAPE 
parameter specified, sun*- 
angles are extracted from 
the MSS Universally for- 
matted data tape. 

DATAFI 

INPUT/UN IT-N , F ILE-M 
(Default: N-3, M-i) 

Unit and file numbers for 
an MSS data tape. 

FORMAT 

MAPTAP 

U or F 

OUTPUT/UN IT-N , FILE-M 
(Default: N-2, M-1) 
(Card ignored If 
classification option 
not specified) 

Universal or LARSYS III 
Unit number and beginning 
file number for the optional 
classification map file 
group. 

DSTTAP 

OUTPUT/UNIT-N , FILE-M 
(Default: None) 

(Card ignored if DST 
output option not 
invoked) 

Unit number and beginning 
file number for the optional 
output of conditional 
probability density esti- 
mates . 



Keyword 


Function 


GTTAP 


GTVOTE 


GTNAM 


Pargjieter 

INPUT/UNIT=N , FILE«M 


N 


, CATNAMj^ , . . . 
^N(j)CAT' ^^™^(|)CAT 


Unit and file numbers of 
grovind truth tape 

N=1 designator for strategy 
(to be provided) for assign- 
ing pixel category number 
from subpixel category 
niimbcrs (used it numbers 
are not all the same) 

Category names corresponding 
to ground truth numbers 





Keyword 


Parameter 


£ unction 


CATNAM 


BLOCK 


DOTGRD 


DOTMAX 


OPTION 


Category name 
category name 


» • • • » 

NOCAT 


Category names. 

These names should match 
those specified in CLASSNAME 
field description cards 
appearing after control 
card input. 


Q/{Qij} i=l,N0FET2, Q block option. 

j=l, NOCAT 
or 

K/fKj} j=l, NOCAT K block option. 

If card omitted or 
parameter field blank, 
default block option 
is used. 


X/Xj^ 

. . . ^YdotY 

(Default: Pick D pixels 
closest to the center 
each block) . 

D 

(Default: D=l) 

CNDPRB 

CLASS 


Spatial dot grid. If dot 
option not invoked, this 
card(s) is ignored. 


Maximum number ot dots 
per block. 

Compute conditional 
probabilities . 

Compute conditional 
probabilities and classify 
test pixel fields. 


DOTS 


GT 


Select dots using spatial 
grid overlay on test field. 

Use ground truth information 
to label training field 
pixels. 




Keyword 


Parameter 


Function 


PRINT 


COMB IN 


PASS 


TEST 


PROJ/{{c^,c.)} 
(Default; Print all 
2“D cuts) 

CNDPRB 


Combine all category 
training fields to make 
a combined block set. 
Compute proportion of 
pixels per category per 
block for training fields, 
test fields. 

Perform all operations 
separately by pass, create 
output files by pass. Used 
for providing input to 
the multi-temporal Bayes 
classifier. The number of 
acquisitions is computed 
from CHANNEL and NCPASS 
control card information. 

Create blocks for each 
category. Do not compute 
conditional probability 
density estimates . Compute 
proportion of pixels in 
test fields in each block 
of each category. Also, 
include pixel category 
numbers in appropriate 
positions in each two cut 
subdivisions . 

Two-dimensional cuts of 
blocks to be printed. 


Print conditional 
probability density 
estimates. 


Keyword 


Parameter 


Function 


CLSMAP 


♦END* 


Print classification map. 

NOTE: All results from 

DOTS, COMB IN, and/or 
TEST options are printed. 

Terminates control card 
input . 


c. Field Cards 

Field cards will be handled exactly as for EOD-LARSYS, 
except that DESIGN header cards will include TRAIN, SAMPLE, 
and TEST designators, as well as OTHER and UNIDENTIFIABLE. 

3.2.4 OUTPUTS 

Files MAPTAP and/or DSTTAP are output where appropriate. 

3.2.5 STORAGE REQUIREMENTS 

TBD 


3.2.6 DESCRIPTION 


The set of subprograms outlined below will provide the processor 
building blocks. 


Subprogram 

EQUPRB 

SET15 

RDDAT 


Function 
Main driver. 

Reads control cards, set data and options. 

Reads training/test/sample fields, establish 
field vertices and enclosing rectangles. 

Reads MSS data tape, extracts radiance valves 
for selected channels, perform sun angle 
correction. 



Subprogram 

BLOCK 

COMBIN 

CLSTST 

SLTDOT 

PIXPRO 


PRPROJ 

PRDOTS 

PRMAP 

PRBLK 

PRCND 

WRTCND 

WRTMAP 

GTASG 


Function 

Builds blocks for specified category. 

Combines blocks from specified categories. 

Classifies sample pixel fields. 

Performs dot selection. 

Computes proportion of pixels from each 
training category and for test fields in 
each block of specified block set. 

Print 2-D cuts of specified block set. 

Print spatial co-ordinates of selected dots. 

Prints classification map. 

Prints specified block set information. 

Prints conditional probability density 
estimates . 

Writes file of conditional probability 
density estimates. 

Writes classification map. 

Read ground truth tape, assign category number 
and hence name to training field pixels, thereby 
breaking up the training fields by category. 

For any set of six subpixels of ground truth 
corresponding to a training field pixel, if 
the six subpixels do not all have the same 
category number, a strategy will be selected 
to choose a number to label the training field 
pixel. 



Common block /GLOBAL/ may be expanded to include the unit and 
file number of DSTTAP. Common block /ISOLNK/ will pass the sun 
angle information. A new common block /EQUPRB/ will pass block, 
data, and option information among subprograms. 

The overall architecture will follow that of EOD-LARSYS as 
closely as possible. 


